An improved ID area law for frustration-free systems 



Itai AratS 

School of Computer Science and Engineering, 
The Hebrew University, Jerusalem, 91904, Israel 

Zeph Landai|^ and Umesh Vaziranj^] 
University of California at Berkeley 
Berkeley, Calif ormna, 94720, USA 
(Dated: June 12, 2012) 

We present a new proof for the ID area law for frustration-free systems with a constant gap, which 
exponentially improves the entropy bound in Hastings ID area law and which is tight to within a 
polynomial factor. For particles of dimension d, spectral gap e > 0, and interaction strength of, 

at most, J, our entropy bound is Sid < ■ X^log^ X, where X (Jlogd)/e. Our proof is 

completely combinatorial, combining the detectability lemma with basic tools from approximation 
theory. In higher dimensions, when the bipartitioning area is \dL\, we use an additional local 
structure in the proof and show that S < 0(1) • \dL\^ log® \dL\ ■ X^ log* X. This is at the cusp of 
being nontrivial in the 2D case, in the sense that any further improvement would yield a subvolume 
law. 



I. INTRODUCTION 

One of the striking differences between quantum and 
classical systems is the number of parameters that are 
needed to describe them. A classical system of n particles 
can be generally described by 0{n) parameters, whereas 
an arbitrary state of a similar quantum system would 
generally require 2'^'^"^ parameters. This exponential gap 
is directly related to the phenomena of entanglement; 
quantum states do not have to be simple product states 
but can be an arbitrary superposition of such states. 

But how genuine is this exponential gap? Is it an ar- 
tifact of the fact that we are considering arbitrary quan- 
tum states, or is it an inherent characteristic of physical 
states that occur in nature, which are, of course, a much 
more restricted set of states? Among the best physical 
systems one may consider with respect to this question 
are quantum many body systems on a lattice, which are 
ubiquitous in condensed matter physics. These systems 
are generally described by a local Hamiltonian that mod- 
els the local interaction between neighboring particles. In 
particular, one is interested in the entanglement proper- 
ties of the ground state of these systems. If the system 
has a spectral gap e > 0, we can approach this state 
by cooling the system to temperatures below that gap. 
Such cold systems would be in an almost perfectly co- 
herent state, and one expects their quantum nature to 
be fleshed out most pronouncedly. What are then the 
entanglement properties of such states? 

Area laws constitute one of the most important tools 
for bounding entanglement in such systems.'^' Starting 
from Bekenstein's seminal result that the entropy of a 
black hole is proportional to the surface of its horizon,^ 
it was later conjectured that the origin of this entropy 
is the quantum entanglement between the inner part of 
the black hole and its surrounding.'^ * This conjecture 
has led researchers to consider the scaling of entangle- 
ment entropy in the ground (vacuum) states in models 



of quantum field theories,'^ where it was demonstrated 
that also in there, entropy scales like the surface area of 
the region rather than its volume, albeit with some log- 
arithmic corrections in critical cases. The same behavior 
was then demonstrated also in the context of spin chains 
by Vidal et al.'^ which has led to the formulation of the 
area-law conjecture that we now describe. 

Consider a local-Hamiltonian system of d-dimensional 
particles that sit on a D-dimensional grid, as demon- 
strated in Fig. ij We say that a state lip) of this system 
obeys an area law if, for any contiguous region L of the 
grid, the entanglement between the particles inside L and 
the particles outside L is upper bounded by a constant 
times the surface area of L as follows: 



SL{i^)<0{\dL\) . 



(1) 



The area-law conjecture then states that if the system has 
a constant spectral gap e > 0, its ground state necessarily 
obeys an area law. This is clearly much stronger than the 
trivial bound on the entropy (known as a volume law), 
which is proportional to the number of particles inside L. 

On the face of it, there are several reasons to believe 
that ground states of gapped local Hamiltonians obey an 
area law. One might intuitively expect that the entangle- 
ment is local because it is generated by local interactions. 
More formally, it has been shown by Hastings that in the 
ground state of gapped systems, the correlation between 
two local observables decays exponentially in their lattice 
distance.'^ It is, therefore, tempting to conclude that only 
the degrees of freedom near the boundary are entangled 
to those outside the region. However, the existence of 
data hiding state^^ implies that such simple reasoning is 
probably insufficient to prove an area law.l^ 

Indeed, even though the area-law conjecture was shown 
to be true in many specific models, mostly in ID (see, for 
example, Refs.lTllTllTTI and[T2land references therein), it 
was not until a few years ago, in a seminal paper ,^21 that 



2 




FIG. 1. An illustration of a quantum many-body system on 
a grid. The particles sit on the vertices and the edges denote 
2-body interactions. When the system is described by a state 
that obeys an area law, the entanglement entropy between 
the particles inside the region L and the particles outside of 
it is proportional to the surface area \dL\. 



Hastings proved that it holds for all ID systems with a 
spectral gap. In this case, an area law says that ground- 
state entropy across any cut in the ID chain is bounded 
by a constant, independent of the system size. From 
this, one can deduce that, for any practical purposes, 
the ground state of such systems can be described by 
a polynomial number of parameters (say, by a Matrix 
Product State), instead of an exponential number (see 
Ref.ini). 

Unfortunately, Hastings' ID proof does not scale up 
to higher dimensions. One reason lies in the form of the 
entropy bound. For a ID system with a bounded inter- 
action strength J, a spectral gap e > 0, and a particle di- 
mension d, the upperbound is 2'^*^^-', where X =^ JJ^d^ 
This exponential dependence on logd is catastrophic if 
we want to use this formula in higher dimensions by fus- 
ing together particles along surfaces parallel to dL. It 
implies an entropy bound of 2*-'(l^^l) instead of 0{\dL\). 
This is exponentially larger than the trivial volume-law 
bound of 0(|L|). Proving area laws in two or higher di- 
mensions therefore is still a wide open problem and a 
holy grail in quantum Hamiltonian complexity. 

A combinatorial approach to proving the area law for 
ID frustration free systems was introduced in Ref. HM 
The proof replaced Hastings' analytical machinery in- 
cluding the Lieb- Robinson bound and spectral Fourier 
analysis with the detectability lemma,'^ a combinato- 
rial lemma about local Hamiltonians. However, the re- 
sulting bound was no better than Hastings' bound be- 
cause, at its heart, the argument followed the same out- 
line as Hastings', including the use of a "mutual informa- 
tion saturation" -type argument (or, in effect, a kind of 
"monogamy of entanglement" -type argument) that leads 
to an exponential slack. Nevertheless, the combinatorial 
nature of the detectability lemma opened up the possi- 
bility of a new inherently combinatorial proof of the area 
law. 



A. Our results 

In this paper we give a new proof of the area law in ID 
for the case of gapped frustration-free Hamiltonians. Our 
proof yields an exponentially better bound on the entan- 
glement entropy than the bound in Ref. [T31 Specifically, 
we prove 

Theorem I.l Consider a ID, frustration-free local 
Hamiltonian system H = Hi, over a system of n par- 
ticles of local dimension d, with Hi being 2-local nearest- 
neighbor interactions with a bounded strength \\Hi\\ < J. 
Assume further that the system has a unique ground state 
\n) and a spectral gap e > 0, and define 



Then along any cut in the chain, the von-Neumann en- 
tanglement entropy between the two sides is bounded by 

SiD < 0{1) ■ X^Xog^ X . (3) 

The importance of this work is threefold. First, in ID, 
we exponentially improve the upper bound as a function 
of e^^: the bound in Eq. ([s]) implies an upper bound 
of ©(e^"^ log^ e^^). This is within a polynomial factor 
of recent lower bounds that were found by Hastings and 
GottesmarP^ and Irani.Ii^'i^ The former showed a ID 
system with a fixed d and J, in which the entropy along 
a cut is at least ©(e"^/*), and the latter showed a lower 
bound of 0{e^^/^'^) for a translation invariant system. 

Second, and more important, is the relevance of this 
work for proving area laws in higher dimensions. A naive 
application of our bound to higher dimensional systems 
would yield an entropy bound of 5* < |9-L|^poly(log |9-L|), 
which is still worse than a volume law, but is much 
better than the previous exponential bound. More- 
over, as we show in Sec. |Vj in two or more dimensions, 
one can further exploit the local properties of the sys- 
tem along the boundary dL and improve the bound to 
• poly (log |9L|). This bound is at the cusp of being 
nontrivial; any further improvement that would bound 
the entropy by \dL\^~^ for any 5 > Q, would prove a sub- 
volume law for 2D. In fact, this gives a subvolume law 
for the case of fractals with dimension 1 < Z? < 2, but 
we shall not pursue that direction here. 

Finally, our approach is differs from Hastings' original 
proof. Here, the "monogamy of entanglement" -type ar- 
gument is replaced with an iterative procedure to find 
product states with increasingly higher overlap with the 
ground state. The central quantity used to analyze the 
progress of this procedure is Schmidt rank, rather than 
more advanced tools like relative entropy and its mono- 
tonicity. While the iterative procedure is based on the 
detectability lemma, a much more intricate combinato- 
rial structure is necessary to ensure that the procedure 
makes progress. 

We now give a high-level overview of the proof. 
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B. High-level overview of the proof 

Consider a ID chain of n d-dimensional particles with 
nearest neighbor interactions, described by the gapped, 
frustration-free Hamihonian H = Qi with a unique 
ground state For the sake of simphcity, we assume 

that Qi are projections and, therefore, =^ 1 — Qi are 
projections to the local ground spaces of the different 
terms. 

The key to proving an area law across a cut, is to find 
a product state |0) — (81 \(f>j^) with respect to the 
bipartitioning of the system, which has a large overlap 
with Our approach to finding such a product state 
is to start with any product state with a nonzero overlap 
with \n) and act on it with an operator that increases 
its overlap with |f2), without increasing its Schmidt rank 
much. Specifically, we construct an operator K with the 
following property: K fixes but when applied to any 
state \ip),it shrinks the component orthogonal to by a 
factor of A while increasing the Schmidt rank of by, at 
most, a factor of D. Clearly, there is a race between these 
two factors D and A. It turns out that when D ■ A < 
1/2, we can amplify the overlap with by replacing 
If/*) = \4>l} 1 01:) with one of the Schmidt vectors of 
-ftr|(/>). This amplification continues all the way until the 
overlap is ^/TJ{2D). A few more applications of K to 
this product state yield a state with Schmidt rank D'^^^\ 
which has constant {D independent) overlap with 
Further applications of K give rise to Schmidt coefficients 
with vanishing mass, and, therefore, the entanglement 
entropy of |r2) can be bounded by 0{logD). 

The task of proving an area law therefore is reduced 
to the task of finding an operator K with _D • A < 1/2. 
Our starting point is the detectability lemma (DL). De- 
note the spectral gap by e > 0. We can partition the 
projections {Pi} into two subsets of even and odd projec- 
tions, which are called "layers" (see Fig. [2|. Inside each 
layer, the projections commute because they are nonin- 

dcf 

tersecting. Consequently, Hodd = Pi ■ P3 ■ P5 ■ ■ ■ and 

dcf 

Ileiien — P2 ■ Pi ■ Pq ' ' ' ^rc the projections into the com- 
mon ground spaces of the odd and even layers, respec- 
tively. Then, by the DL, the operator A =^ neuenllodd 
is an approximation to the ground-state projection. It 
preserves the ground state while shrinking its perpendic- 
ular subspace by an n-independent factor Ao(e) ~ 1 — ce, 
where c is a geometrical factor. Moreover, each applica- 
tion of A increases the Schmidt rank of our state by a 

constant factor of, at most, Dq =^ cP (due to the projec- 
tion that intersects with the cut in the chain; see Fig. |2]) . 
Unfortunately, we generally expect • Aq > 1, so the 
operator A does not, by itself, suffice to carry out our 
plan. 

To construct the operator K we need several new ideas. 
First, we observe that Dq and Aq can be replaced by Dq 
and Aq, respectively, by coarse graining: Fuse k adja- 
cent particles, making them a single particle of dimen- 



sion d'^. Although this only increases the value of the 
product, it creates room for the next step, which is to 
modify the operator A to decrease the factor by which 
it blows up the Schmidt rank. For concreteness, assume 
that the even layer contains the projection that intersects 
with the cut. We will focus on a segment of m projec- 
tions around the cut and denote their product by n,„, 

so Tleven = ^m^rest- Wc will rCplaCC the Operator Heven 

with Hm^rest that closely approximates Tleven while in- 
creasing the Schmidt rank by much less than Dq (when 
amortized over several applications). 

One of the great benefits of using the DL is that the all 
projections in a given layer commute, and, hence, much 
of the following analysis becomes almost classical. In- 
deed, the m projections around the cut {Pi}™ i define a 
decomposition of the Hilbert space of the system into a 
direct sum of 2™ eigenspaces, called sectors. Each sector 
is defined by a string s = (si, . . . , Sm), such that if \ijjs) is 
in the s sector, P^lV's) = (1 ^ Si)\tljs)- A site with Si = 1 
is called a violation, since it corresponds to a non-zero 
energy of the corresponding local Hamiltonian term, and 
X]i=i the total number of violations in the sector s. 

Now, an arbitrary state \ip) can be decomposed as 
\ip) — \ipo) + iV'i); where \ipo) is its projection on the 
zero violations sector and jV'i) is its projection on the 
violating sectors. Clearly 11^1^') = |V^o)- To approxi- 
mate this behavior, we will use the {Pi} projections to 
construct an operator 11^ that is diagonal in the sectors 
decomposition, and in addition IIm|^) = |V'o) + IV''i); with 
IV^'i) in the violating sectors and HV'iir < <^ll^i|| • It fol- 

^ dcf ^ 

lows that the operator A = HmTlrestTiodd approximates 
dd in the sense that A\i}) = A\rt-^) e "H-"-, 

and < {A!^ + S)\\n^f . Let A =^ Ag + (S. 

To construct the operator first consider the op- 
erator N = J27Li{^ ~ Pi)- The operator N counts the 
number of violations in a sector: If |?/'s) belongs to the 
s sector, Nji/js) — \s\ ■ \ips). The operator II„i will be a 
polynomial in N, with the polynomial evaluating to 1 on 
\s\ = 0, and less than 5 on input with \s\ between 1 and 
TO. Three ideas play a critical role in the construction of 
this polynomial and in bounding the increase in Schmidt 
rank. The first is the use of a Chebyshev polynomial, 
which achieves the desired behavior at the to -I- 1 points 
with a degree of only j — 0{y^m\ogS~^). The second 
idea is that it suffices to bound the entanglement across 
any of the m cuts and then to pay a further penalty of, 

at most, Dj (Dq)™ to bound the entanglement across 
the cut of interest. So, if we consider the operator A^, 
each term has degree j£ (i.e., is a product oij£ of the Pis), 
and so the typical cut is crossed ji/m times, resulting in 

a Schmidt rank increase by (Dg)^'^/™ ~ D^^'^. This 
means that the incremental Schmidt rank per application 
of a term of A is Dq^^^, which can be made arbitrarily 
small by choosing to to be large enough. Finally, a re- 
cursive grouping argument shows that we do not have to 
pay a price in Schmidt rank proportional to the number 
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of terms in the polynomial (which would have been catas- 
trophic); instead, we can decompose the operator as 
a sum of only 2*-^'^'°^ operators, for each of which there 
is a (possibly different) cut with entanglement increase 

of ~ D^o'^^- 

Putting it all together, we have an operator K — A^, 
which increases the Schmidt rank hy D = DjD^, where 
f) ^ 2^(^°s^^)u^/^^ and j = 0{^/rnlog^-^) and 
achieves a shrinkage factor of A = for A = Aq +6. It 
is now a matter of simple algebra to fix the parameters 
TO, S, k and I such that D-A < 1/2. The end results turns 
out to be logD = 0(1) • ■ log^X for X = (logDo)/e, 
and this completes the proof. 

Paper Organization: 

We begin with some preliminary definitions and known 
results from mathematics and quantum information in 
Sec. [n] In Sec. |III| we give the proof of our main result. 
Theorem |I.1[ The heart of the proof, which is its most 
technical part, is the diluting lemma. It is proved sep- 
arately in Sec. |IV] In Sec. |V] we provide an outline for 
our entanglement bound in 2D and beyond (the "almost 
volume law" result), and in Sec. VI we sketch the implica- 



tion of our area law to the existence of a matrix product 
state approximation for the ground state. In Sec. |VII| we 
offer our summary and conclusions. 



II. NOTATION AND PRELIMINARIES 

Throughout this paper log(-) will denote the base 2 
logarithmic function. 



reader. We define Pi to be the projection on the ground 

space of Qi, Pi = 1 — Qi. The assumption that H is 
made of projections is not actually a restriction, as is 
demonstrated, for example, in Sec. 2 of Ref. [14] Indeed, 
given a general frustration-free system with a spectral 
gap e > and interaction strength bounded by < J, 
one can always pass to an equivalent system that shares 
the same ground space, which is made of projections and 
has a spectral gap e/J. Therefore, throughout the paper, 
we will drop the J dependence; we will assume J = 1 and 
work in dimensionless units. 



B. The Schmidt decomposition 

Given a state |(/)) and a bipartition of the system to 
two non intersecting sets, R and i, with corresponding 
Hilbert spaces 'Hl,'Hr such that V. = 'Hl®'Hr, we can 
consider the Schmidt decomposition of the state along 
this cut: 10) — \j\Lj) ® \Rj)- Here Ai > A2 > . . . are 
the Schmidt coefficients. Their squares sum 1 (if |(/)) is 
normalized) and are equal to the non-zero eigenvalues of 
the reduced density matrices of either sides of the cut. 

The number of nonzero Schmidt coefficients in the 
Schmidt decomposition of |0) is called the Schmidt rank 
(SR), which we shall denote as SR(0). The usefulness 
of the SR stems from it being a "worst case" estimate 
for the entanglement. As such, it is often easy to bound. 
The following facts are easy to verify: 

Fact II. 1 

1. SR(0 + i))< SR(0) + SR(V'). 



A. Local Hamiltonians 

We consider a A:-local Hamiltonian H acting on H = 
((D'')'*", the Hilbert space of n particles (spins, qudits) of 
dimension d that sit on a _D-dimensional grid. Our main 
result concerns the ID case with D — 1, but in Sec. [V|we 
will consider higher dimensions. We assume H — Hi 
where each Hi is a non-negative bounded operator that 
acts non-trivially on a constant number of k particles 
(hence the term local Hamiltonian). We further assume 
that H has a unique ground state \n) with ground energy 
and has constant spectral gap e > 0. Since the Hi are 
all non-negative, the ground state must be a common 
zero eigenstate for each of the Hi] a Hamiltonian with 
this feature is known as frustration free. We denote by 
fL'^ C H the orthogonal complement of the ground space 
of H. Thus, H'^ is an invariant subspace for H and 

H\-H± > el . (4) 

We further assume that the Hi terms are projections, 
and, henceforth, we will denote them by Qi to remind the 



2. If O is a k-local operator whose support intersects 
both H-L o,nd "Hr, then it can increase the SR (with 
respect to the bi-partitioning) of any state by, at 
most, a factor of d'' : SR(O0) < d'=SR(0). If O 
intersects only one part of the system, its action 
cannot increase the SR. 

3. Consider a ID system. If ri and rj are the SR of 
10) that correspond to cuts between particles i, i + 1, 
and j,j + 1, then d^'^^^Vj < r,j < d'^^^Vj. 

An important fact about the SR is the following corol- 
lary of the Eckart- Young theorerrP^I, which states that 
the truncated Schmidt decomposition provides the best 
approximation to a vector in the following sense: 

Fact II. 2 Let {ip) be a vector on a bi-partitioned Hilbert 
space Hl ® "Hr, md let Xi > X2 > . . . be its correspond- 
ing Schmidt coefficients. Then the largest inner product 
between \ip) and a normalized vector with Schmidt rank 
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C. The Detectability Lemma 

One of our main technical tools in this paper is the 
Detectability Lemma (DL). Originally proved in Ref. [T51 
in the context of quantum constraint satisfaction and 
promise gap amplification, a simpler and stronger ver- 
sion of the DL was proved in Ref. [T31 where it was used 
in the context of gapped, frustration-free local Hamilto- 
nians. This is the form that would be used here. To 
state it, consider a gapped ID frustration-free Hamilto- 
nian with nearest-neighbor interactions that is defined on 
a chain of n d-dimensional particles, 



H 



i=l 



(5) 



As explained previously, we assume that the 2-local inter- 

dcf 

actions terms Qi are projections, and we set Pi — 1 — 
to be the projection of the local ground space of every 
term. 

We partition the projections into two sets which we 
call layers: the odd layer Qi, Qs, Q5, . . . and the even 
layer Q2, Qa, Qe-, ■ ■ ■■ Within each layer the projections 
are non-intersecting and, therefore, are commuting. It 

follows that Hodd '= Pi • P3 ■ -P5 ■ • • and Ueven '= -P2 • Pt • 
Pe ■ ■ ■ are the projections into the ground space of the 
odd and even layers respectively. An illustration of this 
construction is shown in Fig. |2] We then have: 

Lemma II. 3 (Detectability Lemma (DL) in ID) 

Let A '= HoddJieven, 0-nd let Ti,^ he the orthogonal 
complement of the ground space. Then 



< Ao(e) = 



(e/2 + 1)2/3 



(6) 



For brevity, we will often drop the e dependence, and 
simply write Aq. We note that the DL is not restricted 
to the ID case, and can be easily generalized to any di- 
mension by using more than two layers. 

In the commuting case, Tlf,^en and Tiodd commute with 

def 

each other, and their product, A = Hodd ■ ^even is the 
projection into the ground state of the system. Gener- 
ally, however, they do not commute, and as a result A 
is only an approximation to the ground space projection: 
It leaves the ground space invariant while shrinking its 
perpendicular space by some factor. The DL quantifies 
this approximation: It tells us that the shrinking factor is 
bounded away from 1 by a constant Ao(e) that depends 
on the spectral gap and not on the system size. 



D. Chebyshev polynomials of the first kind 

We recall some basic facts about Chebyshev polyno- 
mials of the first kind. More information can be found 
in Chapter 22 of Ref. i20]or in any standard text book on 



approximation theory. This is a family of polynomials 
{Tn{x)}, where n denotes the degree of the polynomial. 
They are solutions of the ODE 



For < 1 they are given explicitly by 



(7) 



Tn(2:) =^ cos(ncos ^ x) , (8) 

and, for general x, they are given recursively by 

To{x) - 1 , (9) 
Ti{x)=x, (10) 
Tn+i{x) = 2xTnix) - T„_i(x) . (11) 

By the recursion relation, the coefficient of x" in T„ (x) 
is 2"~^, and by Eq. ([s]), it has n simple roots (Cheby- 
shev nodes) in (—1, 1), Xk — cos (f ^^^) ,k — 1, . . . , n. 
Therefore, 



Tr,{x) = 2"-'l[{x-Xk) 



(12) 



fe=l 



We conclude this section with an inequality about the 
behavior T„(x) near x = — 1 that will be used in the 
proof of the diluting lemma. From Eq. ([8| it follows that 
for all -1 < x < 1, |T„(x)| < 1 and that r„(l) = 1; 
7ri(— 1) = (—1)". Finally, let us look at the derivative of 
r„(x) at X = — 1: substituting x = — 1 at Eq. ([T]) gives us 
r;(-l) + n2r„(-l) = and so T^(-l) = -nM^„(-l) = 
(— l)"+^n2^ Since T„(x) has all its minimas/maximas 
inside [—1, 1], its derivative T^(x) outside that region can 
only grow in absolute value, and, therefore, for all 6 > 0: 

\Tni-l - S)\ > |T„(-1)| + <5|T^(-1)| >l + nH. (13) 



III. PROOF OF THE MAIN THEOREM 

Recall from the outline of the proof presented in the 
introduction, that our goal is to construct an operator K 
whose effect is to rapidly increase the overlap with the 
ground state, while only slowly increasing the Schmidt 
rank. We formalize this property of K in the notion of 
a {D, A)-AGSP below and then show that if the trade- 
off between D and A if favorable, i.e., D ■ A < 1/2, 
then we can show that there is a product state that has 
large overlap with the ground state, which, in turn, leads 
to a bound on the entanglement entropy of the ground 
state. Once this is established, we can move on to the 
central construction of this work, which is performed in 
the diluting lemma |III.5| This is where an operator with 
the proper trade-off of D and A is constructed. 

We begin with a quantitative definition of an operator 
that moves any vector toward the ground state. 
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Layer 1 (even) 
Layer 2 (odd) 




FIG. 2. The settings of the detectabihty lemma for a ID system with 12 particles H = X^^lj Qi- The local interaction terms, 
Qi, are divided into two layers: the even layer and the odd layer. The dashed red line denotes a cut in the system between 
particles 6 and 7. 



Definition III.l An Approximate Ground- Space 
Projection (AGSP) 

Consider a local Hamiltonian system H — Hi on 
a ID chain, together with a cut between particles i* and 
i* + 1 that bi-partitions the system. We say that an opera- 
tor K is a {D, A) -Approximate Ground Space Projection 
(with respect to the cut) if the following holds: 

• Ground space invariance: for any ground state 

K\^) ^ \VL). 

• Shrinking: for any state G also 
K\n^) e H^, and \\K\n^)\\^ < A. 

• Entangling: for any state \(p), SR{K\(j))) < D ■ 
SK{(f)), where SR(-) is evaluated with respect to the 
bi-partitioning. 

We refer to D as the SR factor and A as the shrinking 
factor. 



decomposition can be written as \v) = X^iLi K\Li)\Ri 
It follows that 



We note that with this definition, the DL combined with 

yr ■ A X dcf 

iModd 



Fact 
id' 



II.l 



implies that A = Ug 



is a {Do,Ao) 



(1 + e/2)-2/3)_AGSP. In its bare fo rm, however, this 
operator is not useful to us since its SR factor is too 
large with respect to its shrinking factor. Specifically, 
it turns that the important quantity to consider is the 
product D ■ A. The following lemma shows how having a 
{D, A)-AGSP with D ■ A < ^ implies the existence of a 
product state whose overlap with the ground state is at 
least 1/V2D. 

Lemma III.2 // there exists an {D,A)-AGSP with D ■ 
A <\, then there is a product state \<f>) = \L)(^\R) whose 
overlap with the ground state i^^ n = {^\(f>) > 1/V2D. 



Proof: Let K he a {D, A)-AGSP with D 

dcf 



A < i and 



\4)') a product state = |L') (g) \R') whose overlap 
with the ground state is /i = {fl\(t)') < 1/V2D. Below 
we show that there exists another product state with a 
larger overlap with the ground state. 

By assumption, we can expand {(p') as = /i|ri) + (l — 

lj.^y/^\n^),with\n^) e H^. Let Since 
is an (D, A)-AGSP, it follows that + Si\nj^) 

with e n-^, Sf < A, and SR(0i) < D. 

Setting \v) = ^^|(/>i) to be the normalization of 
we have SR(i;) = SR(0i) < D. Therefore, its Schmidt 



D 



\{n\v)\ 



<J2^^\{^\L^)\R^) \ < 



4=1 



D 



where the last inequality follows from Cauchy- Schwartz 
and the fact that J^i -^f = 1- Therefore there must be an 
i for which 



m\L,)\R,)\^ > 



^1101 



> 



D{n^ + Sj) - D{n^ + A) ■ 



But since D ■ A < 1/2, and, by assumption /i < yj^, it 

follows that Z)(/i^+A) < 1, and so the overlap of \Li) \Ri) 
with the ground state is larger than /i. ■ 

With this bound in place, we start from the product 
state with the maximal overlap with the ground state, 
and use any AGSP to obtain controlled approximations 
of the ground state, from which an upper bound on its 
entropy can be found. A very similar argument was used 
in Hastings' proof of the ID area law.^'^ 

Lemma III. 3 // there exists a product .state whose over- 
lap with the ground .state is at least fi, together with a 
{D,A)-AGSP, then the entanglement entropy of is 
bounded by 



(14) 



The proof can be found in the appendix. The brief 
overview is that we begin with the asserted product state, 
and repeatedly apply the AGSP to it. The result is a se- 
ries of vectors with increasing SR by a factor D each time, 
that approach the ground state at a rate quantified by 
powers of A. Using these vectors and the Young-Eckart 
theorem (Fact II. 2) provides adequate upper bounds for 
the high Schmidt coefficients of the ground state to bound 
the entropy. 

Lemmas |III.3| and |III.2| can be combined to give 

Corollary III.4 // there exists an {D, A)-AGSP such 
that D ■ A < ^, the ground state entropy is bounded by: 



S<Oil)-\ogD 



(15) 
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We are left, therefore, with the challenge of designing 
an operator K which is a (D, A)-AGSP with A < 1/2. 
This is the driving construction of this work. It is stated 
in the following lemma, and is proved in the next section. 

Lemma III. 5 (The diluting lemma) Consider a ID 
gapped frustration- free Hamiltonian, with a spectral gap 

e > and particle dimension d, and define X . 
Then for any cut in the chain there exists an (D, A)- 
AGSP with £> • A < 1/2 and 



logD < 0(1) • log^X 



(16) 



Substituting the result of this lemma in Corollary |III.4| 
proves Theorem |LT| 



IV. PROVING THE DILUTING LEMMA 



(LEMMA III.5I 



We will prove the diluting lemma, by modifying the DL 

dcf 

operator A — Ileve7i^odd to a new operator A, which is 
not an AGSP, but has similar properties (see Dcf. III.ll 



• Ground space invariance: for any ground state 

A\n) = \n). 

• Shrinking: for any state G H^, also A\il^) e 
n^, and < A. 

• Entangling: for any state \(j>), and any integer 
i > 0, SR(i^|<^)) < DiD'^ ■ SR(</.). 

• The parameters Di,D,A satisfy 
1 



D- A< 



2 ' 



logD < log(A-i) < 0(1) -log^AT 
log Di < 0(1) • X^log^X . 



(17) 

(18) 
(19) 



Having A in hand, we can choose £o ~ [log_D/] and 
obtain the {D, A)-AGSP 



K A'« 



(20) 



with (D, A) = (D/D^", A^°). It satisfies £>• A < 1/2 and 
logD = 0(1) • log Di - logD < 0(1) • X^ log** X . 
We now proceed to define the general form of A. 



Therefore, our construction of A will modify Hevem leav- 
ing Uodd intact. 

We begin by considering the piece of lieven consisting 
of the m even projections closest to the cut. We denote 
this set of projections by /,„, and abusing the notation a 
little, we relabel them by Pi, ... , P,„ and define 



dcf 



(21) 



4=1 



the projection into the common ground state of these 
projections. Note that in this notation the cut inter- 
sects with Pi-, where i* — [to/2] (see Fig. [S]). Denoting 
by Tlrest the product of all the remaining projections in 
Iletienj wc havc fle^en — !!,„ • Hrest- Wc wiU approximate 
Tim by an operator Hm, and then define A by 



A fir. 



■ n 



odd 



(22) 



Analysis of the amount of entanglement created by pow- 
ers of A will focus exclusively on the structure of tlm 
since Tlrest i Tlodd do not increase the SR along the cut. 

One of the great benefits of using the DL is that the 
all projections in a given layer commute, and hence the 
analysis becomes almost classical. Indeed, as we said in 
the outline of the proof, the projections in Im define a 
decomposition of the Hilbert space of the system into a 
direct sum of 2™ eigenspaces, called sectors. Each sector 
is defined by a string s = (si, . . . , Sm), such that if |V's) 
is in the s sector, PilV's) = (1 — Si)\^s}- A site with 
Si = 1 is called a violation, since it corresponds to a 
non-zero energy of the corresponding local Hamiltonian 
term. We denote by \s\ — X^Zli *i the total number of 
violations in the sector s. Finally, we will also consider a 
coarse-grained decomposition in which we group together 
all (™) sectors with k violations. The direct sum of these 
subspaces is called the fc-violations sector. 

Consider now an arbitrary state \4>) and expand it as 
IV') = IV'o) + IV'i): where IV'o) is its projection on the zero 
violations sector and {ipi) is its projection on the violat- 
ing sectors. Clearly, Tlm\i^) = {tpo)- To approximate this 
behavior, we will use the {Pi} projections to construct an 
operator Tim that is diagonal in the sectors decomposi- 
tion, and in addition Hm I fA) = IV-'o) + |V'i)i with \ip[) in the 
violating sectors and < '^llV'ill^ foi" some error pa- 

rameter S > 0. Then it is easy to verify that A|J7) = jJl), 

A\n^) e n^, and < (A-K(5)|lf^^f . This gives 

us 



A = Ao + (5 . 



(23) 



A. General settings 

Without loss of generality, we assume that the bi- 
partitioning cut in the chain intersects with an even 
projection (see Fig. [2]). As a result, when applying A, 
only the Tleven portion of the operator increases the SR. 



B. Constructing tlm — a general discussion 

Before actually constructing the operator Tim , it might 
be useful to describe what we hope to accomplish. Recall 
that we would like to show that the increase in SR due to 
application of A^ is bounded by DjD^, where D ■ A < ^. 
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The rough idea is to show that there is sorae cut between 
particles i and i+1 within the support of 1^ such that the 
SR across this cut does not grow much due to apphcation 
of A^. This would account for the factor in the above 
bound. Moreover, by Fact the SR across the middle 
cut + 1 can only be larger than the SR between 

i, i + 1 by a factor of at most < D™ =^ Dj. 

There are several approaches to constructing Per- 
haps the most obvious one is by "diluting" ; instead of us- 
ing a product of m projections, we may use a product of 
rm randomly chosen projections for some < r < 1. Af- 
ter applying such 11^ for £ layers, there would be columns 
with fewer than r£ entangling projections, and so the SR 
along these columns will be bounded by Dq^. Apply- 
ing Fact |II.1| as described above, the SR in the middle 
cut would be at most Dq^D™, implying D = Dq and 
Di = D^. 

What is the A factor of such construction? Intu- 
itively, sectors with high number of violations are easier 
to "catch" because there is a higher chance of collision 
between one of their violations and the rm projections. 
Indeed, on average, the mass in the k sector is shrunk by 
at least C"-*^) / ( ™ ) < (l - r)'=. But this means that the 

V rm J I \rm/ — ^ ' 

low-violations sectors, and in particular the one- violation 
sector, are barely shrunk. The latter can be shrunk by 
as little as 1 — r, which means that A = Aq + \ — r. This 
can never get us to Z) • A < 1/2. 

To overcome this problem, we take a different approach 
for the construction of n„i using the following operator: 



Definition IV. 1 (The N operator) 



fortunately, however, it blows up the mass of the high- 
violating sectors \s\ > k. One possible solution is there- 
fore to use the IB^ operators in conjunction with the di- 
luting approach. This was indeed the approach taken 
in a previous papei'^, and it might prove beneficial in 
other contexts. Here, however, we use a much simpler 
solution that relies solely on the N operators by utilizing 
the Chebyshev polynomials. 



C. Constructing Ilm using Chebyshev polynomials 

To construct entirely from W, we want a polyno- 
mial of minimal degree P{x) such that P(0) — 1 and 
|P(x)p < (5 < 1 for every integer 1 < x < m. In such 

case, ilm '= f'(N) will leave the zero-violations sector 
intact, while shrinking the other sectors by 5. One naive 
approach is to take P{x) — {l — x)-{l — x/2) ■ ■ ■ {1 — x/m). 
This gives 5 = at the price of a polynomial of degree to, 
the same degree as the original 11^ (thereby creating too 
much entanglement). For a lower degree polynomial with 
the desired properties we turn to the Chebyshev polyno- 
mial, a central object in approximation theory. As noted 
in Sec. |IID[ the Chebyshev polynomial of the first kind 
Tn{x) oscillates between —1 and 1 in the region [—1,1] 
and then increases rapidly outside that region. The idea 
is therefore to use a polynomial that is the mapping of the 
Chebyshev polynomial from [—1,1] to [1, to] and rescaled 
to be 1 at a; = 0: 



N 



def 



(24) 



Definition IV.2 (The C,„(a;) polynomial) The 

Cm{x) polynomial is a ^/m-order polynomial that is 
defined by 



The operator N counts the number of violations in a 
sector: if belongs to the s sector, N|?/;s) = \s\ ■ [1/)^). 

We can use the N operator to annihilate the mass in 
the low-violations sectors. Consider, for example, the 

operator IBk 1 — j;N. When acting on \ips) we get 
2 



IjlBfclV'.)!!' = (1 - xj 11^-11'- Therefore Bfe completely 
annihilates the mass in the k sector, while leaving the 
violations sector intact. In addition, as we shall see in 



Sec. IV D it does not create much entanglement. Un- 



def 



m+1 



2 

L-m (") 



(25) 
(26) 



Figure H shows Cm{x) for to — 36. It is easy to verify that 
1) C„(0) = 1, and 2) \Cm{x)\'^ < | for every 1 < a; < to. 
The first claim follows from definition, while the second 
follows from the fact that [(^^(O)! > 3, which follows 
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from Eq. (13 I: 



^ \ m — 1 



(27) 



> 1 + TiVmf > 3 . 

m — 1 



Notice that Cm{x) is a polynomial of degree -^/m - a 
huge improvement over the naive construction of degree 
TO. In fact, it can be shown that this construction is op- 
timal: as shown in Ref. i23i any polynomial that satisfies 
the above two properties must be of at least degree 

To improve the shrinking factor, we can apply 
[Cm (a;)]'', where q is a parameter to be determined later. 
We can now define the operator A that appears in the 
diluting lemma: 

Definition IV. 3 (The Chebysiiev-based operator A) 

Given integers m and q, the Chebyshev-based operator A 
is 



A =^ fi™ • n.,,, • n, 



where 



dof 



(28) 
(29) 



We note that 11^ is a degree j — q^/m polynomial in IN, 
which leaves the zero- violations sector intact and shrinks 
the violating sectors by at least S = (1/9)''. Conse- 
quently, by Eq. (Ei), we get A = Aq + (1/9)'?. 



Let us now turn to the task of upper bounding the SR 
that is generated by A^. 



D. Upper bounding the SR factor of A^. 

We bound SR generated by A^ in the following lemma: 

Lemma IV. 4 Set j q^/rn, the degree of [Cm{x)Y- 

(30) 



Then for any state \(j)) . 

SR(iV) < Dib'^ ■ SR(</.) , 



where 



(31) 
(32) 



Proof: Recall that n,„ is a polynomial of degree j in N 
and therefore can be written as a sum of j -I- 1 terms: 



(33) 



1=0 



Consequently, A^ can be written as a superposition of 
(j -I- 1)^ terms of the form 



with ii, . . . ,ig between and j. When applied to a gen- 
eral state 10), the term that potentially generates the 
highest SR is (N-' • Urest^oddY ■ We will therefore up- 
per bound the total SR by upper bounding its SR and 
multiplying the end result by (j + 1)^. 

Definition IV. 5 (A min-entangling operator) We 

say that an operator C is min-entangling with respect to 
some cut in the support of Im if it is of the form 

Mi^ ■ (nrest^odd) ' ' ' Mi^ ■ {HrestJiodd) ' ■ (HrestJlodd) , 

where Mi are polynomials in the projections of Im such 
that 

1. There exists a subset of at most jl/m of the Mi 's 
that contains the projection P € Im that intersects 
with the cut. 

2. Each of the rest of the Mi only contains projections 
that are either strictly to the left of P or strictly to 
the right of P (in Im)- 

It follows that only those Mi that contain the "entan- 
gling" projection P increase the SR across the cut, and 
therefore the total SR increase is bounded by D^^^^ . The 
bound on the SR follows from the following decomposi- 
tion of A^: 

Claim IV. 6 (IN-' • ^rest^oddY can be written as a sum 
of at most [4(j + 1) • ij/2 + 1) • (j/4 + 1) • • • (1 + 1)]^ 
min-entangling operators. 

Proof: Say that an operator C is t-min-entangling if 
there is a contiguous interval I Im of to/2* projections 
such that C is of the form 

Mi^ ■ {Urest^odd) ' ' ' Mi^ ■ {Hrest^odd) ' Afii ' {^rest^odd) , 

where a subset of at most j^/2* of the M^'s are equal to 

dof 



N 



Pi, while each of the other M^'s is made 



of projections that are either to the left of / or to the 
right of I (in Im), and in particular does not include any 
projection from I. 

The proof relies on a recursive construction that af- 
ter t rounds decomposes (N-' • Hrest^oddY ii^^o a sum 
of t-min-entangling operators. After log to rounds of re- 
cursion we end up with the desired decomposition into 
min-entangling operators. 

We begin by splitting N into two terms: N = Ni,-|-Njf, 



dcf 



where = ZTJii^ " ^0 and N« = E"™/2+i(l " 
P,). Then N^' = (N^ -f Nfl)^" = J^l^^ {{)W-'Wj^, Ex- 
panding across all £ layers, we end up with {j-\-lY terms. 
Let us focus on one such term, and assume that at layer 
i it has the powers N]^~"'N^*. Since the total degree in 
Ni and N^^ across the i layers is j£, it follows that the 
degree of one of the two across the £ layers must be at 
most j£/2, and hence every term is 1-min-entangling op- 
erator. Consequently, O^-'^rest^oddY can be written as 
a sum of at {j -\- lY 1-min-entangling operators. 



N^^ • (Ilrestllodd) • • - N'^ • (IlrestTlodd) ' N'^ • (H^estU 



odd ) 
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FIG. 4. The polynomial C3q(x), which is based on the Chebyshev polynomial of the first kind Tq(x). For x G [1, 36] we have 
\Ci6(x)\ < 1/3. 



Proceeding recursively, we now pick one such term. It 
either contains at most j£/2 Nl operators or at most 
j£/2 N/j operators. Assume without loss of generality it 
is Nl. We write Ni=Ni,L+NL_R, and consequently, 
every NJ' can be written as the sum of + 1 terms: 
N^' = ¥!ll+n,¥!ll^¥!LR + ...+¥!l'ii. Therefore, upon 
opening the product, we obtain a sum of rii=i('^i + 1) 
terms such that each term has either a maximal degree of 
ji/4: of f^LL or of N^fl. These are all 2-min-entangling 
operators. Moreover, it is now easy to verify subject 
to the constraint '^j — i-^/^j the total number of 

terms Jli^il^j ~^ 1) maximized when all n.t are equal: 
ni=i("i + 1) — + 1)^- To summarize, we have just 
shown that (S^-'Ilrest^oddY can be written as a sum of 
at most {j + l)(j/2 + 1) 2-min-entangling operators. 

Continuing in this fashion for logj rounds we end 
up with a total of at most [{j + 1) • (j/2 -I- 1) • {j/A + 
1) • • • (1 -I- 1)]^ log j-entangling operators. At this stage, 
the total degree of each such operator in N/ is at most 
j£/2^°^^ = £. To bound the increase in the remaining 
log(m/j) rounds, we observe that subject to the con- 
straint — ^/'^'^ the expression ni=i("'j + 1) 
maximized when £/2'^ of the n^'s are 1 and the rest 0: 

nLi("» + 1) < 2^^^ • It follows that the total increase 
in the number of terms over these rounds is bounded by 
2^2^/22^/4 . . . < 4^. This completes the proof of the claim. 



(which is always the case), 

4(j + 1)2 . ij/2 + 1) • 074 + 1) 
< 20j^/2 23'°s'j . 



(1 + 1) 



The SR contribution of each such min-entangling op- 
erator at that cut that passes through its diluted col- 
umn is at most D^^^™, and since this column is at 
most m particles away from the bi-partitioning cut, it 
follows from Fact III. II that its SR contribution to the 

Therefore 

(34) 
(35) 



Jl/m 



bi-partitioning cut is at most Z3q 
SR(i^|0)) < Dib'^ ■ SR(0), with 

b^2Qf/^2'^'°^"Wl'"' . 
This concludes the proof of Lemma |IV.4| 



At this point it is worthwhile to pause and take inven- 
tory of where we are. We have shown that the operator 
is an AGSP with characteristic factors {DjD^, (Ag -I- 
(1/9)')^}. We are searching for an AGSP whose product 
of characteristic factors is below 1/2. This will be the 
case in our situation for some £ as long as the product 
D ■ [Aq + (1/9)'^] is less than one. Our work so far has 
shown 



D 



[Ao + (l/9)«] = 20j-'^/22iiog^-£,j/™^^g ^_ (1/9)5 



(36) 



We can now finish off the proof of the main lemma 
of this section. Claim [TVT6| gives a decomposition of the 
operator (N^ • Urest^oddY as a sum of at most [4(j + 1) • 
(j/2 -I- 1) • (j74-|- 1) • • • (1 + 1)]^ min-entangling operators. 
As noted above, to apply this result to A*^ we must further 
multiply this number by {j + 1)^, so A^ can be written 
as a sum of no more than [4(j -I- 1)^ • {j/2 + 1) • (j74 -|- 
1) • • • (1 -f 1)]^ terms. It is easy to verify that for j > 2 



Recalling that j = q\/m it is clear that we can find con- 
stants q,m (in Do) such that D ■ (1/9)*^ < 1/2; however 
the term D ■ Aq may still be bigger than 1. 

To solve this problem another idea is needed: coarse- 
graining. As we shall see in the next subsection, fus- 
ing together k adjacent particles allows us to move to 
a new local Hamiltonian system with Dq,Aq replaced 
by D^,A^. Taking fc = O((?log A"^), would then yield 
Al^ ~ (1/9)"?, and consequently lead to £) • A < 1/2. 
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n — II 

Pi P'i Pi 



FIG. 5. An illustration of a fc-coarse grained system in 
ID with fc = 4. The elongated rectangles denote the coarse- 
grained projections and the ovals denote the original projec- 
tions. Underneath even coarse-grained projections, one can 
"pull" a pyramid of original projections, and similarly above 
an odd coarse-grained projection. Together, they form k lay- 
ers of the original projections. This shows that the coarse- 
grained shrinking exponent is actually Ag = Aq. 



In particular, let PyVi denote the "pyramid" or 
"light-cone" of original projections pulled from P/ 
as in the figure above. Then H^^^ — HJ^^^ • 
Pyri • Pyrg • Pyrg • • • , and n^„g„ = Pyr2 • Pyr4 • 
PvrK • • • n' 

^ J^b even 

2. Pyri • Pyrg • Pyrg • • • Pyi^ ■ Pyr^ ■ Pyrg • • • = 

(TT TT \k + l 

y^^odd^^even J 

This is due to their pyramid-like shape, they can 
be commuted past each other and re-arranged as 
k+1 layers of the original projections (see the figure 
above) . We note that applying k + 1 original layers 
of the DL shrinkj231 the perpendicular space by a 
factor of An'. 



F. Gluing it all together 



E. Coarse-graining 

Consider a fc-coarse-grained system, in which we fuse 
together k adjacent particles, making them a single parti- 
cle of dimension d'^'. The new Hamiltonian of the system 
would now be a 2-local Hamiltonian on a chain, consist- 
ing of projections Q'^ = 1 — P/, where P/ denotes the 
projection into the common ground space of the 2k par- 
ticles that form the coarse grained particles i and z -I- 1. 
We define the odd/even layers projections H'^^^, ^'even ac- 
cordingly, and notice that every application of ^'everJ^'odd 
increases the SR by at most a factor of Dq. To esti- 
mate the shrinking factor, we may use the DL on the 
new Hamiltonian using the new spectral gap. 

However, there is a much stronger bound that we can 
obtain by using the DL on the original system. Based on 
an idea that has already appeared in a related form in 
Ref. [131 we show that: 

Claim IV.7 

TT' IT' 

odd even 

TT' (Y\ TT TT TT ^^f^ TT' 

^^odd ' \^^eve7i^^odd^^odd^^even ) ' ^^even ' 

This immediately implies: 

Corollary IV. 8 // H' — J^i H'i is the k-coarse-grained 
version of H = Hi, then the SR and shrinking factors 
of the DL AGSP of H' are related to those of H by 



K,a;,) = (z?^-,Ao^) 



The claim follows from two observations: 



(37) 



If we first fc-coarse-grain the system, and then con- 
struct A, we obtain the following factors: 



(38) 
(39) 
(40) 



A = a;; + (i/9)« . 



We have 3 free parameters: m, q, k (recall that j — q^/m). 
To finish the proof we show how these can be chosen to 
obtain D • A < 1/2. 

Our first step is to demand that k is large enough such 
that A§ < (1/8)*. Looking at Eq. (jej), it is easy to verify 
that as long as e < 10, this can achieved by defining, for 
example. 



dcf 



20g 



(41) 



Then A < 2(1/8)'. A sufficient condition for £)• A < 1/2 
is therefore 



^■3/22! log= j^7209j/™2-39 



< 



1 

80 



(42) 



or equivalently, 



\ log J + \ log2 J + AQX^q -3q<- log(80) . (43) 
2 2 m 



where we used the definition X 



dcf lop; d 



Theorem I.l ) 



log -P i 
2e 



- (see 



To satisfy this equation, we demand that 40Ar^ < 2 
so that the leading term in the LHS of Eq. ( 43 ) would be 



-q. Substituting j — qy/m, leads us to define 



1 . For any coarse-grained constraint P/ , we can always 
"pull" from it a product of the original projections 
Pi that act on the support of P/. The reason being 
that P/ projects into the common ground space of 
all the original 2k particles in its support, and the 
original projections Pi are trivial on that space. 



dcf 



TO = {20qXy 



Going back to Eq. ( 43 ) , we now have to satisfy 
3 1 9 

- log j + - log^ j-q<- log(80) . 



(44) 



(45) 
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FIG. 6. A simple settings for a 2D case. / is the length of 
the boundary. 



where now j — q^/m = 20Xq^. 

It is easy to see that this equation can always be satis- 
fied for large enough q, since the logarithmic factors are 
weaker than the {—q) factor. A straightforward analysis 
yields 



q<0(\0g'X) , 

and therefore 

log(A-i)=3g-l<0(log2x) , 

and 

TO = {20Xqf < 0{X^log^X) . 

Then Dj = D^"\ and as logOj^ = 
0{X\og^X), we get 

\ogDi = m log = O {X^ log^ X) 



iOqX 



(46) 
(47) 
(48) 

(49) 



This concludes the proof of Lemma |III.5| 



2D AND BEYOND 



leading power of / from P to and get a bound of 
S<0{l)-P-X^\og^{I-X). 

We first note that up to some unimportant geometrical 
factors, the DL works also in 2D (and in any dimension 
for that matter). See Ref. |14| for more details. For sim- 
plicity, let us assume that also in the present case we have 
only two layers and it is only one layer that increases the 
SR with respect to the cut, which we will still refer to 
as the "even layer". The idea is then to mimic the ID 
case and replace a segment of to columns around the cut, 
which we denote by Im with the operator Ilm ■ The differ- 
ence is that now I„i contains to • I projections instead of 
TO. Therefore, the polynomial that we would use would 
be [Cmi{x)]'^, where Cmi{x) is based on the Chebyshev 
polynomial of degree \JraI. Just as in the ID case, the 
shrinking factor A of A is given by A = Ao -I- (1/9)'. 

What is the SR exponent D of this construction? A 



very similar analysis to that in Sec. IV D can be done 
here: we recursively divide Im by cutting it in parallel 
to the cut. Just as in the ID case, there are m such 
possible cuts in I„i. The difference is that now the SR 
contribution of the restriction of N to a certain cut is not 
Dq but 0(1) •/ • Dq, because N contains a sum of 0(1) • / 
projections along a cut that is parallel to the boundary. 
Therefore, the overall SR after £ layers is: 



with 



SR(i^|0)) <Di D^ , 
Di {D'^') , 



(50) 



(51) 
(52) 



Can Theorem 11.11 be extended to the 2D case and be- 
yond? Currently, we do not have such a proof. Never- 
theless, it is possible to make a small step in the right 
direction, as we sketch in this section. 

For the sake of clarity, we will restrict ourselves to the 
2D case, and consider the case where the bi-partitioning 
of the system is along one dimension, and the length of 
the boundary is /, as depicted in Fig. |6] In terms of the 
discussion in the Introduction, / — |9L|, the area of the 
separating surface between L and L. 

To prove an area-law for this system, one would like 
to show that the von-Neumann entropy between the two 
parts of the system satisfies S < 0(1) • /. 

A straightforward approach for obtaining a bound on 
S, which was also mentioned in Ref. [131 is to treat the 
2D system as a ID system by considering the particles 
along a column as a huge partic le of dimension . Then 
to get a bound from Theorem I.l we replace d i-^ , 

or equivalently, AT / • AT. This gives us S* < 

0{l)-{I-X)Hog'{I-X). 

The above derivation completely failed to take into ac- 
count the local aspects of the problem along the cut. 
We now show how one can make use of it to drop the 



dof 



and j 



qy/ ml. In comparison with the ID indices in 
Eqs. (34 35 1, we see that Dq i-^ in the formula for 



Dj, but Do M- IDq in the formula for D - an exponential 
saving in the latter. 

From here, the analysis follows essentially the same 
steps as the ID case. We perform an initial fc-coarse- 
graining to drive (Do, Ao) 1-^ (-Do, Ag). Demanding that 



dof 



Af5 < (1/8)?, we set k = 20q/e as in Eq. (41 1. Then the 



analogue of Eq. ( 43 1 is 



log J + - log2 J + (log / + 40Xq) ^ - 39 < 



log(80) , 
(53) 



and demanding that (log I+2QXq) — < 2q, leads us to set 



dcf 



( i log / + 20 ATg) 2 . /. Plugging this back to Eq. ( 53|) 
yields g < 0(l)-log^(/-A:), and so m < 0{1)-I-X^ log^/- 
X). This results in logZ?/ < 0(1) • P ■ X^log^{I ■ X), 
and logD < 0{q) < 0(1) • log^(/ • X), which brings us to 



S <0{l) ■ ■ X^ -log^il ■ X) 



(54) 
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VI. MATRIX PRODUCT STATES 

We briefly sketch the imphcations of these resuhs for 
the efhcient approximation of the ground state via Ma- 
trix Product States. Specificafly, Theorem |I. 1 1 impUes the 
following corollary 

Corollary VI. 1 Under the same conditions of Theo- 
rem for any integer k > there exists a matrix 
product state (MPS) |?/'fc) with bond dimension k such 
that 



(55) 



This result follows from first noting that across any cut, 
we can use the properties of the AGSP K to bound the 
norm of the tail of the Schmidt coefficients: X]z>d* — 
< 2DA^, where /i is the overlap of the initial 



product state with the ground state (see Lemma III. 2 
in Sec. |III| and the proof of Lemma [ill. 3| in the appendix 

). Then letting k = D\ we find that if we truncate 
the Schmidt coefficient of a given cut at k, we introduce 
an error of S — ^i>k ^1 — where we used the fact 
that D ■ A < 1/2. "Then applying the MPS construc- 
tion procedure of VidaP^, and truncating the SR across 
all cuts to k, yields an MPS of bond dimension k that 
approximates the ground state to within the accumu- 
lated error n5 — Finally, recalling from Eq. (|l6[) in 
Lemma|lIL5|that log D < 0{1)-X^ log^ X gives EqTsHl). 



VII. CONCLUSIONS 

In conclusions we have given a new proof of the area- 
law for ID, gapped and frustration-free systems. The 
proof uses the DL and the Chebyshev polynomials to up- 
per bound the entropy by 0(1) -X^ log® X, for X = 
which is exponentially better than the bound in Ref. TT. 
It brings us much closer to the recent lower bound of 
0(1) • e^^'^ (for fixed d) of Hastings and Gottesman^^ and 
IranP. 

There are two immediate directions in which one might 
hope to improve this result. First, it is seems very plausi- 
ble that the proof can be extended to the frustrated case. 
Indeed, already in Hastings' ID prooP^, one of the first 
steps of the proof is to reduce the frustrated system into 
an almost frustration-free system by a coarse-graining 
procedure. It is possible that a similar technique can be 
also deployed here. Moreover, one might try to take a 
more direct approach and construct the AGSP directly 
from the Hamiltonian H, by replacing N with H. Both 
are sums of projections, and a similar SR analysis can be 
performed on operators of the form poly(i/). 

The second direction, which is much more interesting, 
and probably more difficult, is to try to generalize the 
area-law for 2D and beyond. In fact, any sub- volume law 
for 2D would be an extremely interesting result. One pos- 
sibility is to improve the log d dependence of the bound 



in Eq. ([s]) . A bound linear in log d would imply an area- 
law in all dimensions, whereas anything like (logd)^"'' 
would imply a sub- volume law for low dimensions. How- 
ever, it seems to us that the right approach is to better 
understand and exploit the locality of the problem in 
the direction parallel to the cut. This was done in a very 
weak way in Sec.jVj and led to an improved bound in the 
2D case. We believe there are more local aspects of the 
problem that can be used. For example, in the current 
AGSP construction we do not assume anything about 
the underlying distribution of violations. Yet, these arise 
from a very specific local operation, namely the appli- 
cation of the previous AGSPs. If one could show that 
the distribution of violations decays exponentially in fc, 
it may be enough to use a Ghebyshev polynomial of a 
degree smaller than y/m - which may lead to an area 
law. More generally, one might want to prove some no- 
tion of independence, or decay of correlation, along the 
cut, thereby reducing the 2D problem to a stack of nearly 
independent ID problems. 
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Appendix: Bounding the entropy of a steps-like 
probability distribution 



Proof of Lemma UlLlJ Let = J2i>l^^\L^) \R^) 
be the Schmidt decomposition of the ground state jJl), 
and let = |L) (g) |i?) a product state such that 
|((/)|f2)| ~ fi. Define the sequence of states \vi) to be the 
normalization of the vectors K^\(f>). Since if is a {D, A)- 
AGSP, it follows that \vg) has the following properties: 

1. SR{ve) < D'^. 

2. \{vtp)\ > 



Then by Fact II. 2 (Eckart- Young theorem). 



X < 



A^(l -^2 



< 1 - 



< = pe 



We will use this bound to upper bound the entropy of 
the {Af}. Choose 4 = [^^1 so that pi^ < 1. For 
£ > 2£o + 1, we we will upper bound the entropy of the 
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distribution with the bounds 



A| < p2i < pe 



This is maximized by spreading out the probabihty mass 
uniformly in each interval + -^^^ich results 

in an upper bound on the contribution of the entropy on 
the interval [D^^ + 1, D^C^+i)] of 



pi log < Pe, log 



< A^-^« lo 



Pi 



Pe 



A^-^«(^-4)lof 



Above, the second inequality follows from p^^ < 1 along 
with the fact that the function x log ^ is increasing 
for a; < 1. 



Summing Eq. (A.l) over £ > 2£q + 1 then yields an 



upper bound of the entropy contribution of this tail as 

jj2{e+i)/ie~eo} 



E 

i>2eo+i 



A^-^«(^-4)log- 



< 



E 

e'>ea+i 



£'A'^' log( 



< 



A 

A , ,D\ 



Above, the first inequality follows from first noting that 
for the choice of £, the exponent on the power of D is 
upper bounded by 4, then making the substitution of 
£' = £ — £q. The second follows from using the series 
equality Ej>i J^^ = 

Combining this estimate with the maximal contribu- 
tion of entropy over the first 1)2(^0+^) terms, namely 

(24 + 1) log Z) = 0(1) • '°fj^^'^ log D, gives the following 
bound on the entropy: 



S<0{1) 



\ 71. TT iog D 

log(A-i) 



A 

(T^A)^ 



log(^) 



To simplify this expression, note that for any inte- 
ger k > 1, is an AGSP with characteristic fac- 
tors (£)'^,A'^). Choosing k = [ jo^^^-ij l ensures that 
\ < A*"' < i. Then substituting the parameters for 
(A.l) ^ ^i^d A*^ for A in the above expression yields 



S<0{l)■[\og{^i'^)\ogD'' 
< 0{1) ■klog{ti-^)logD 



log D 



4k 
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